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ENHANCING VIDEO IMAGES DEPENDING ON PRIOR IMAGE ENHANCEMENTS 

The invention relates to the field of video image processing and more specifically 
to enhancing subsequent images of a video stream in which frames are encoded based on 
5 previous frames using prediction and motion estimation. 

Those skilled in the art are directed to US6259472 and US5862254 which describe 
enhancing of video images. These citations are hereby incorporated herein in whole by 
reference. 

In the invention herein, a video stream containing encoded frame based video 
1 0 information is received. The video stream includes an encoded first frame and an encoded 
second frame. The encoding of the second frame depends on the encoding of the first 
frame. More specifically, the encoding of the second frame includes motion vectors 
indicating differences in positions between regions of the second frame and corresponding 
regions of the first frame, the motion vectors define the correspondence between regions of 
15 the second frame and regions of the first frame. 

The first frame is decoded and a re-mapping strategy for video enhancement of the 
decoded first frame is determined using a region-based analysis. Regions of the decoded 
first frame are re-mapped according to the determined video enhancement re-mapping 
strategy for the first frame so as to enhance the first frame. 
2 0 The motion vectors for the second frame are recovered from the video stream and 

the second frame is decoded. Then regions of the second frame, that correspond to regions 
of the first frame, are re-mapped using the video enhancing, region-based, re-mapping 
strategy for the regions of the first frame so as to enhance the second frame. 

The reuse of the video enhancing re-mapping strategy of previous frames for 

2 5 subsequent frames greatly reduces the processing required for providing video 

enhancements. 

In a further aspect of the invention one or more regions of the second frame are 
selected depending on whether a similarity criteria is met for a similarity between the 
regions of the second frame and corresponding regions of the first frame. Then the re- 

3 0 mapping of the regions of the second frame based on the video enhancing region-based re- 

mapping strategy for the first frame is only performed for the selected regions of the 
second frame. 
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Limiting the reuse of the video enhancing re-mapping strategy of previous frames 
to only regions of the subsequent frames that are sufficiently similar to the previous frame, 
increases the likelihood that the subsequent frame will be enhanced. 

A set top box using the decoder of the invention provides enhanced video pictures 
5 with minimal additional hardware costs. Using the decoder of the invention for a video disc 
player allows higher compression of groups of pictures on the video disc with the same 
perceived quality. A television that uses the decoder of the invention can display higher 
quality video pictures or utilize a more highly compressed video signal while providing the 
same quality as a less compressed signal. 
1 0 Additional aspects and advantages of the invention will become readily apparent to 

those skilled in the art from the detailed description below with reference to the following 
drawings. 

Figure 1 illustrates an example method of the invention for region-based enhancing 
of subsequent video images. 
1 5 Figure 2 shows portions of an example decoder of the invention for providing 

region-based enhanced subsequent video images. 

Figure 3 shows portions of a example set top box using the decoder of figure 2. 

Figure 4 illustrates portions of an example DVD player using the decoder of figure 

2. 

2 0 Figure 5 shows portions of an example television using the decoder of figure 2. 

In the following descriptions of the drawings, the same labels in different figures 
indicate similar devices. For convenience, such devices will only be described in detail in 
relation to the earliest described figure in which they appear. 

Figure 1 shows a specific embodiment 100 of the method of the invention. In 102 a 

2 5 video stream is received. The stream contains encoded information for groups of pictures 

(GOP), the first picture in the GOP is an intra-coded frame (I-ftame) and subsequent 
pictures in the GOP are non-I-frames. The decoding of the subsequent non-I-frames 
depends on the coding of the I-frame. The video stream may be, for example, an MPEG II 
stream of packets, in which case, the non-I-frames may be, for example, predicted frames 

3 0 (P-frames), and/or bi-directional frames (B-frames). However, any other type of GOP 

based video stream may be used as long as it contains subsequent frames that are encoded 
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based on previous frames. In 104, the I-frame is decoded. Decoding of I-frames is well 
known in the art. 

In 106, a re-mapping strategy for re-mapping the intensity values to adjust the 
contrast is determined so as to enhance the decoded I-frame. The re-mapping strategy may 
5 use a region-based intensity analysis. Methods of determining re-mapping strategies for 
regions of decoded frames using such analysis are well known, and those skilled in the art 
are directed to US6259472 and US5862254 which disclose such re-mapping of intensity 
values. In 108, the intensity values of the decoded I-frame are re-mapped according to the 
determined re-mapping strategy. 

10 In 108, motion vectors for the subsequent non-I-frame are recovered from the video 

stream as is well known in the art. Generally, motion vectors are differences in the 
positions between regions in an I-frame and corresponding regions in a non-I-frame that is 
coded dependent on the I-frame. The regions may be regions of similar intensity or regions 
of similar texture or any other predefined similarity between frames may be used to define 

15 regions. 

In 1 10, DC coefficients for the subsequent non-I-frame are recovered from the 
video stream as is well known in the art. Generally, the DC coefficients are the differences 
between the values of the image blocks of the I-frame and the predicted values of 
corresponding image blocks of the non-I-frame, after motion estimation. Motion estimation 
2 0 is generally, the re-mapping of the regions depending on the motion vectors during 
decoding. 

In 1 12, the intensity values in the regions of the non-I-frame are re-mapped 
depending on the re-mapping strategy of corresponding regions of the I-frame so as to 
adjust the contrast to enhance the non-I-frame. The correspondence between the regions is 

2 5 determined from the motion vectors. 

If a region of the subsequent non-I-frame is more similar to the corresponding 
region of the I-frame (on which the decoding of the non-I-frame depends), than it is more 
likely that using the re-mapping strategy, developed for re-mapping the intensity values of 
the corresponding I-frame region, for re-mapping the intensity values for the non-I-frame 

3 0 region, will enhance the non-I-frame. On the other hand, if the region of the non-I-frame is 

substantially different than the corresponding region of the I-frame, then using the intensity 
value re-mapping strategy for the corresponding region of the I-frame, for re-mapping the 
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intensity values of the region of the non-I-frame, is not likely to enhance the non-I-frame, 
and in fact may even reduce the quality of the non-I-frame. 

Using the strategy for re-mapping the I-frame for enhanced contrast for re-mapping 
subsequent frames greatly reduces the overhead required for contrast enhancement. 
5 Generally any region-based video processing for improving the quality of an I-frame can 
be applied to corresponding regions of subsequent non-I-frames in a similar maimer. 

The re-mapping of the intensity values of the non-I-frame may also depend on the 
DC coefficients of the blocks of the regions of the non-I-frame on which the decoding of 
the non-I-frame depends. Generally, small values of the DC coefficients for a region 

1 0 indicate that the region is likely to be similar to the corresponding region in the I-frame 
after motion compensation. Thus, when its determined the DC coefficients are relatively 
large, then the re-mapping strategy for the intensity values of the I-frame are not used to re- 
map the intensity values of the non-I-frame. This can be determined by using a threshold 
for the DC coefficients, which can be a constant predetermined value or a variable value 

1 5 calculated for each region, and then using the I-frame re-mapping strategy to re-map the 
intensity values for a region only when the value of the DC coefficients are below the 
threshold. Those skilled in the art can easily determine either a standard predetermined DC 
coefficient threshold for regions in a frame, or a method to calculate a DC coefficient 
thresholds for each region in a frame, that can be used to enhance the frames. A useful DC 

2 0 coefficient threshold can be determined, for example, by a simple trial and error process of 
comparing frames in which different thresholds or threshold calculation methods have been 
applied. 

In addition, the re-mapping of the intensity values of the non-I-frame may also 
depend on the properties of the motion vectors. As discussed above, the motion vectors are 

2 5 used to identify regions of the subsequent non-I-frame, that correspond to regions of the I- 

frame, in a process called motion compensation. However, in addition to their use in 
motion compensation, the properties of the motion vectors can also be used to determine 
the likelihood that the regions of the non-I-frame are similar to the corresponding regions 
of the I-frame. 

3 0 Each motion vector has a value and a direction. Relationships between the motion 

vectors of neighboring regions include differences in values and differences in direction 
called orthogonality. Generally, for a non-I-frame, small values for the motion vector for a 
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region, small differences between motion vector values of a region and its neighboring 
regions, and small differences between motion vector directions of a region and its 
neighboring regions, each indicate that the region is more likely to be similar to the 
corresponding I-frame region. 
5 Generally, a small value of the motion vector for a region of the non-I-frame 

indicates that the region is more likely to be similar to the corresponding region in the I- 
frame on which its decoding depends. When its determined the motion vector values are 
relatively large, then the re-mapping strategy for the intensity values of the I-frame is not 
used to re-map the intensity values of the non-I-frame. This can be determined by using a 

1 0 threshold for motion vector values, which can be a constant predetermined value or a 

variable value calculated for each region. Then the I-frame re-mapping strategy is used to 
re-map the intensity values for regions only if the values of the respective motion vectors 
for those regions are below the threshold. Again, those skilled in the art can easily 
determine either a standard predetermined motion vector value threshold for regions in the 

1 5 non-I-frame, or a method to calculate a motion vector value threshold for regions in a non- 
I-frame, that can be used to enhance the non-I-frames. A useful motion vector value 
threshold can be determined, for example, by a simple trial and error process of comparing 
non-I-frames in which different thresholds or threshold calculation methods have been 
applied. 

2 0 Also, consistency in the values of the motion vectors between a region of the non-I- 

frame and its neighboring regions in the non-I-frame indicates that the region is more likely 
to be similar to the corresponding region in the I-frame on which the decoding of the non-I- 
frame depends. When its determined that the motion vector values of neighboring regions 
are substantially inconsistent or dissimilar to the motion vector values in the region, then 
25 the re-mapping strategy for the intensity values of the I-frame is not used to re-map the 
intensity values of the non-I-frame. This determination can be done, for example, by 
determining the average difference between the values of motion vectors for regions and 
the values of the motion vectors of neighboring regions, and then comparing the average 
differences in value to a value consistency threshold. The value consistency threshold can 

3 0 be a constant predetermined value or a variable value calculated for each region. Then the 

I-frame re-mapping strategy is used to re-map the intensity values for regions only if the 
average differences in the values of the motion vectors are below the value consistency 
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threshold. Similarly squares of the value differences or other combinations of the value 
differences or other well know statistical approaches could be used to determine value 
consistency. Again, those skilled in the art can easily determine either a standard 
predetermined value consistency threshold for regions in a frame, or a method to calculate 
5 value consistency thresholds for regions in a non-I-frame, that can be used to enhance the 
non-I-frames. A useful value consistency threshold may be determined, for example, by a 
simple trial and error process of comparing different non-I-frames in which different 
respective value consistency thresholds or threshold calculation methods have been 
applied. 

1 0 Also, consistency of motion vector direction between a region and neighboring 

regions in the non-I-firame, indicate that the non-I-frame region is more likely be similar to 
the corresponding regions in the I-frame on which its decoding depends. When its 
determined that the motion vector direction of neighboring regions are substantially 
inconsistent or dissimilar to the motion vector direction of the region, then the re-mapping 

1 5 strategy for the intensity values of the I-frame for the region is not used to re-map the 
intensity values of the non-I-frame region. This can be determined, for example, by 
determining the average difference between the directions of motion vectors for regions 
and the directions of the motion vectors of neighboring regions, and then comparing the 
average differences in direction to a direction consistency threshold. The direction 

2 0 consistency threshold can be a constant predetermined value or a variable value calculated 
for each region. Then the I-frame re-mapping strategy is used to re-map the intensity 
values for regions only if the average differences in the values of the motion vectors are 
below the direction consistency threshold. Similarly squares of the direction differences or 
other combinations of the direction differences or other well know statistical approaches 

2 5 could be used. Again, those skilled in the art can easily determine either a predetermined 

value for the direction consistency threshold or a method to calculate such a threshold for 
each region in a frame, that can be used to enhance the non-I-frames. A useful direction 
consistency threshold or method to calculate such a threshold can easily be determined, for 
example, by a simple trial and error process of comparing frames in which different 

3 0 thresholds or threshold calculation methods are applied. 

Multiple indications of similarity may be applied to determine whether to apply the 
re-mapping strategy of an I-frame to a subsequent non-I-frame whose decoding depends on 
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the I-frame. Those skilled in the art will know how to develop functions that combine 
multiple indications of similarity to determine whether to apply the I-frame re-mapping 
strategy to the non-I-frame. For example, they can use the I-frame contrast re-mapping 
strategy only when all the indications of similarity meet respective threshold requirements. 
5 Alternatively or in addition, they can determine the differences or relative differences 
between the indications of similarity and their respective thresholds and only apply the I- 
frame contrast re-mapping strategy to the non-I-frame when the total of the differences or 
relative differences (or square of the differences or relative differences) is below a further 
threshold. 

1 0 Those skilled in the art will know how to apply this process to more complex 

dependencies between frames such as subsequent non-I-frames, whose decoding is 
dependent on previous non-I-frames, whose decoding is dependent on I-frames. They can, 
for example, just apply the contrast enhancing re-mapping strategy of the I-frame to such 
subsequent non-I-frames. Alternatively, they can, for example, develop a second contrast 

1 5 enhancing re-mapping strategy for the previous non-I-frame, which can be applied to tihe 
subsequent non-I-frame. 

The decoding of a non-I-frame may be dependent on multiple other frames. Those 
skilled in the art will know how to develop a function that applies the contrast enhancing 
re-mapping strategy of the multiple frames to the non-I-frame. 

20 Figure 2 illustrates the basic components of a video decoder 120 of the invention. 

A video stream of packets containing a group of pictures (GOP) is received at an 
input 122, the first picture in the GOP is an I-frame and a subsequent picture in the GOP is 
a non-I-frame. The video stream may be an MPEG stream as described above. 

A decoding unit 124 decodes the frames of the GOP. The decoding unit provides 

25 the decoded I-frame to a buffer 126, to a processing unit 128. 

Processor 128 uses a region-based intensity analysis to determine a strategy to re- 
map intensity values to change contrast to enhance the I-frame image, and re-maps the 
intensity values of the I-frame in buffer 126 using the re-mapping strategy. The buffer then 
passes the contrast enhanced I-frame to output 132 through summation unit 130. 

3 0 The decoding unit recovers the DC coefficients and the motion vectors for the 

subsequent non-I-frames of the GOP and supplies them to the buffer 126 and processor 
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128. Processor 128 re-maps the original I-frame and the contrast enhanced I-frame 
according to the motion vectors. 

The decoding unit provides the decoded differences between the I-frame and the 
subsequent non-I-frames to the summation unit 130. Depending on a selection criteria, for 
5 each region, the buffer 126 supplies either the motion vector re-mapped I-frame or motion 
vector re-mapped contrast enhanced I-frame to summation unit 130. The summation unit 
combines the decoded differences and the re-mapped enhanced I-frame together to produce 
the decoded subsequent non-I-frame. 

The selection criteria in this specific example for a region is as follows: 

1 0 DC<T1 ; and MW<T2; and MVS<T3 ; and MVO<T4; and 

al(DC-Tl) 2 + a2(MW-T2) 2 + a3(MVS-T3) 2 + a4(MVO-T4) 2 < T5 
Where DC is the value of the DC coefficients for the region; MW is the motion vector 
value for the region; MVS is the average difference between the value of the motion vector 
and the value of the motion vectors of the regions above, below, and to each side of the 

15 region; and MVO is the orthogonality of the motion vector for the region with respect to 
the motion vectors of the regions that border the region; T1-T5 are predetermined 
thresholds; and al-a4 are constants. The constants and thresholds were chosen statistically 
based on comparisons of results by viewers, to most consistently enhance the resulting 
image. 

2 0 Figure 3 shows a set top box 140 of the invention. Tuner 142 selects a video stream 

for a video program from among multiple streams for several different video programs 
provided at input 144. The video decoder 120 of figure 2 decodes the video program and 
provides the decoded program to output 146 which can be directed to a video display e.g. a 
television set. 

2 5 Figure 4 illustrates a DVD player 1 50 of the invention. The video player has a 

motor 152 for rotating a video disc 154. A laser 156 produces a radiation beam 158. A 
servo 160 controls the position of an optical system 162 to scan an information layer of the 
video disc with a focused spot of the radiation beam. The information layer effects the 
beam and reflects or transmits the beam to a radiation detector 164 for detecting the beam 

3 0 after it has been effected by the information layer. Processor 1 66 controls the servo and 

motor and produces a video stream containing encoded information for a group of pictures 
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(GOP) depending on the detection. Then the video decoder of figure decodes the video 
stream and supplies the decoded video stream to an output 168 for connection to a display. 

The processor 166 can be the same processor 128 as in the decoder of figure 2 or an 
additional processor can be provided as shown. 
5 Figure 5 shows a television 200 of the invention. A tuner 142 selects a video stream 

of a video program to be played from a plurality of video streams for respective video 
programs provided to input 144. The decoder 120 of figure 2 decodes the selected video 
program and provides it to display 206. The television may have components of the DVD 
player of figure 4 for playing stored video programs (or recording programs) using the 
1 0 DVD components. 

The invention has been described above in relation to specific example 
embodiments. Those skilled in the art will know how to modify these example 
embodiments within the scope of the invention herein. The scope of the invention is only 
limited by the following claims. 
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